Cost-based analyses of random neighbor and derived sampling methods

نویسندگان

چکیده

Abstract Random neighbor sampling, or RN , is a method for sampling vertices with mean degree greater than that of the graph. Instead naïvely vertex from graph and retaining it (‘random vertex’ RV ), selected instead. While considerable research has analyzed various aspects extra cost second typically not addressed. This paper explores perspective cost. We break down into two distinct costs, an already sampled vertex, we also include actually selecting vertex/neighbor use rather discarding it. With these three costs as our cost-model, explore compare to in more fair manner comparisons have been made previous research. As delve number variants are introduced. These improve on cost-effectiveness regard particular priorities. Our full cost-benefit analysis highlights strengths weaknesses methods. particularly focus how methods perform high-degree low-degree vertices, which further enriches understanding they can be practically applied. suggest ‘two-phase’ specifically seek cover both separate phases.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improvement of Chemical Named Entity Recognition through Sentence-based Random Under-sampling and Classifier Combination

Chemical Named Entity Recognition (NER) is the basic step for consequent information extraction tasks such as named entity resolution, drug-drug interaction discovery, extraction of the names of the molecules and their properties. Improvement in the performance of such systems may affects the quality of the subsequent tasks. Chemical text from which data for named entity recognition is extracte...

متن کامل

Towards Cost-efficient Sampling Methods

The sampling method has been paid much attention in the field of complex network in general and statistical physics in particular. This paper presents two new sampling methods based on the perspective that a small part of vertices with high node degree can possess the most structure information of a network. The two proposed sampling methods are efficient in sampling the nodes with high degree....

متن کامل

Evaluation Accuracy of Nearest Neighbor Sampling Method in Zagross Forests

Collection of appropriate qualitative and quantitative data is necessary for proper management and planning. Used the suitable inventory methods is necessary and accuracy of sampling methods dependent the inventory net and number of sample point. Nearest neighbor sampling method is a one of distance methods and calculated by three equations (Byth and Riple, 1980; Cotam and Curtis, 1956 and Cota...

متن کامل

Evaluation Accuracy of Nearest Neighbor Sampling Method in Zagross Forests

Collection of appropriate qualitative and quantitative data is necessary for proper management and planning. Used the suitable inventory methods is necessary and accuracy of sampling methods dependent the inventory net and number of sample point. Nearest neighbor sampling method is a one of distance methods and calculated by three equations (Byth and Riple, 1980; Cotam and Curtis, 1956 and Cota...

متن کامل

Sampling Methods for Random Subspace Domain Adaptation

Supervised classification tasks like Sentiment Analysis or text classification need labelled training data. These labels can be difficult to obtain, especially for complicated and ambiguous data like texts. Instead of labelling new data, domain adaptation tries to reuse already labelled data from related tasks as training data. We propose a greedy selection strategy to identify a small subset o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied Network Science

سال: 2022

ISSN: ['2364-8228']

DOI: https://doi.org/10.1007/s41109-022-00475-x